DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains

نویسندگان

  • Klaus Zechner
  • Alexander H. Waibel
چکیده

In this paper, we present a summa.rization system for spontaneous dialogues which consists of a novel multi-stage architectm'e. It is specifically aimed at addressing issues related to tlle nature of the l;exts being spoken vs. written and being diMogical vs. monologica.l. The system is embedded in a. graphical user interface ~md was developed and tested on transcripts of recorded telephone conversations in English and Spanish (CAI,LHOMI,;).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Speech-Specific Characteristics for Automatic Speech Summarization

In this thesis we address the challenge of automatically summarizing spontaneous, multi-party spoken dialogues. The experimental hypothesis is that it is advantageous when summarizing such meeting speech to exploit a variety of speech-specific characteristics, rather than simply treating the task as text summarization with a noisy transcript. We begin by investigating which term-weighting metri...

متن کامل

Domain adaptation with augmented space method for multi-domain contact center dialogue summarization

In this paper we propose a method to improve the quality of extractive summarization for contact center dialogues in various domains by making use of training samples whose domains are different from that of the test samples. Since preparing sufficient numbers of training samples for each domain is too expensive, we leverage references from many different domains and employ the Augmented Space ...

متن کامل

Learning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues

This paper proposes a novel extractive summarization method for contact center dialogues. We use a particular type of hidden Markov model (HMM) called Class Speaker HMM (CSHMM), which processes operator/caller utterance sequences of multiple domains simultaneously to model domain-specific utterance sequences and common (domainwide) sequences at the same time. We applied the CSHMM to call summar...

متن کامل

Turn Segmentation into Utterances for Arabic Spontaneous Dialogues and Instance Messages

Text segmentation task is an essential processing task for many of Natural Language Processing (NLP) such as text summarization, text translation, dialogue language understanding, among others. Turns segmentation considered the key player in dialogue understanding task for building automatic HumanComputer systems. In this paper, we introduce a novel approach to turn segmentation into utterances...

متن کامل

A flexible formal language for the orthographic transcription of spontaneous spoken dialogues

Orthographic transcriptions of speech are important in most fields of research concerned with spoken language. For spontaneous speech they have to be created manually, resulting potentially in inconsistent or erroneous transcriptions. We propose a new flexible and easy-to-use formal language for the orthographic transcription of spontaneous speech. All relevant phenomena introduced by spontaneo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000